skip to main content


Search for: All records

Creators/Authors contains: "Toutios, Asterios"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Individuals who have undergone treatment for oral cancer oftentimes exhibit compensatory behavior in consonant production. This pilot study investigates whether compensatory mechanisms utilized in the production of speech sounds with a given target constriction location vary systematically depending on target manner of articulation. The data reveal that compensatory strategies used to produce target alveolar segments vary systematically as a function of target manner of articulation in subtle yet meaningful ways. When target constriction degree at a particular constriction location cannot be preserved, individuals may leverage their ability to finely modulate constriction degree at multiple constriction locations along the vocal tract. 
    more » « less
  2. There is a lack of general agreement among previous studies (e.g., Bakst, 2016; Dediu & Moisik, 2019; Westbury et al., 1998) on whether measurements of vocal tract morphology are robust predictors of inter-speaker variation in tongue shaping for American English /ɹ/. One possible reason is the different quantifications of /ɹ/ tongue shapes that were employed. The current study compares the relationships between a single set of anatomical measurements and three different measures of lingual articulation for /ɹ/ in /ɑɹɑ/ in midsagittal real-time MRI data. A novel method was developed to quantify the palatal constriction location and length, which served as the first two measures of tongue shape. A linear Support Vector Machine divided the constriction location and length measures into regions that approximate the visually identified categories of “retroflex” and “bunched.” The third shape measurement is the signed distance of each token of /ɹ/ to the division boundary, representing the degree of “retroflexion” or “bunchedness” based on palatal constriction properties. These three measures showed marginally to moderately significant linear relationships with two specific measures of individual speakers’ vocal tract anatomy: the degree of mandibular inclination and the length of the oral cavity roof. Overall, the effect of anatomy on the lingual articulation of /ɹ/ is not strong. [Work supported by NSF, Grant 1908865.]

     
    more » « less
  3. The theory of Task Dynamics provides a method of predicting articulatory kinematics from a discrete phonologically-relevant representation (“gestural score”). However, because the implementations of that model (e.g., Nam et al., 2004) have generally used a simplified articulatory geometry (Mermelstein et al., 1981) whose forward model (from articulator to constriction coordinates) can be analytically derived, quantitative predictions of the model for individual human vocal tracts have not been possible. Recently, methods of deriving individual speaker forward models from real-time MRI data have been developed (Sorensen et al., 2019). This has further allowed development of task dynamic models for individual speakers, which make quantitative predictions. Thus far, however, these models (Alexander et al., 2019) could only synthesize limited types of utterances due to their inability to model temporally overlapping gestures. An updated implementation is presented, which can accommodate overlapping gestures and incorporates an optimization loop to improve the fit of modeled articulatory trajectories to the observed ones. Using an analysis-by-synthesis approach, the updated implementation can be utilized: (1) to refine the hypothesized speaker-general gestural parameters (target, stiffness) for individual speakers; (2) to test different degrees of temporal overlapping among multiple gestures such as a CCVC syllable. [Work supported by NSF, Grant 1908865.]

     
    more » « less
  4. Abstract

    Real-time magnetic resonance imaging (RT-MRI) of human speech production is enabling significant advances in speech science, linguistics, bio-inspired speech technology development, and clinical applications. Easy access to RT-MRI is however limited, and comprehensive datasets with broad access are needed to catalyze research across numerous domains. The imaging of the rapidly moving articulators and dynamic airway shaping during speech demands high spatio-temporal resolution and robust reconstruction methods. Further, while reconstructed images have been published, to-date there is no open dataset providing raw multi-coil RT-MRI data from an optimized speech production experimental setup. Such datasets could enable new and improved methods for dynamic image reconstruction, artifact correction, feature extraction, and direct extraction of linguistically-relevant biomarkers. The present dataset offers a unique corpus of 2D sagittal-view RT-MRI videos along with synchronized audio for 75 participants performing linguistically motivated speech tasks, alongside the corresponding public domain raw RT-MRI data. The dataset also includes 3D volumetric vocal tract MRI during sustained speech sounds and high-resolution static anatomical T2-weighted upper airway MRI for each participant.

     
    more » « less